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Abstract 

Jet clustering algorithms are widely used to analyse hadronic events in high en- 
ergy collisions. Recently a new clustering method, known as 'Cambridge', has been 
introduced. In this article we present an algorithm to determine the transition val- 
ues of y cu t for this clustering scheme, which allows to resolve any event to a definite 
number of jets in the final state. We discuss some particularities of the Cambridge 
clustering method and compare its performance to the Durham clustering scheme 
for Monte Carlo generated e + e~ annihilation events. 
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1 Introduction 



In collider physics, clustering of the experimentally accessible hadronic final states is used 
to determine the underlying parton structure of events. In e + e~ annihilation the widely 
known JADE |I| and Durham jet algorithms have become indispensable in this process, 
permitting a wide range of important tests of QCD, allowing refined measurements of 
electro-weak physics with hadronic final states and being used in searches for new physics. 

Recently a new jet clustering scheme, known as Cambridge, has been introduced ||. 
This scheme is a modification of the original Durham fc^-clustering scheme. The Cam- 
bridge algorithm is designed to minimise the formation of spurious 'junk-jets', jets formed 
from a multitude of low transverse momentum particles, unrelated to the underlying par- 
ton structure. 

For all the above mentioned algorithms, clustering of the final state is performed iter- 
atively and is terminated at a clustering specific resolution scale, generically denoted by 
the resolution parameter y cut . By changing the value of y cut , the final state is resolved 
into a varying number of jets. The Cambridge algorithm involves three basic components 
in this iterative process. It uses an ordering variable, u^-, a test variable, jjij, and a recom- 
bination procedure. In JADE-type jet clustering algorithms only two basic components 
are involved, since the ordering variable, Vij, and the test variable, t/ij, are identical. 

In this note we review the Cambridge finder and discuss some of its experimental pe- 
culiarities. In terms of computing this algorithm is more complex compared to the JADE 
and Durham algorithms. Due to the distinction between test and ordering variables, the 
sequence of clustering now depends on the value of y CVLt . We show that the jet multiplicity 
obtained with this algorithm is not monotonically decreasing for increasing y cut , and that 
for some events it is impossible to resolve a certain jet multiplicity. Therefore the concept 
of the 'transition values in y cut ' has to be defined more precisely. The transition value at 
which the event classification changes from n-jets to m-jets, when going to larger values 
for y cut , will subsequently be referred to as y m *~ n value. 

Next we developed a fast algorithm to obtain the transition values y m ^ n for the 
Cambridge finder. Using this algorithm, we compare results for Monte Carlo generated 
e + e~ — > qq events between the Durham and Cambridge finder. We compare their per- 
formance in determining the size of the hadronization corrections. As another example, 
we determine the performance for hadronic decays of W + W~ production at LEP2 H. 
Finally we give our conclusions and cite an address to download our FORTRAN code. 



2 The Cambridge algorithm 

In JADE-type jet clustering algorithms one iteratively combines particles to form final 
state jets. First one introduces a 'test variable' yy. The pair of two objects i and j with 
smallest value for y^ is selected and its objects are combined or the iteration is terminated 
when jjij > y cut for all pairs of objects. For the JADE and Durham algorithms, the test 
variables y^ and y9 are defined respectively as 
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where E± and Ej denote the energies of particles % and j and 9ij their opening angle. 
Note that we normalise the values of y^ and to the visible energy, E vis , which is the 
sum of energies for all particles observed in the final state. The second ingredient is the 
recombination procedure. Normally the i?-scheme is taken, for which the four-momentum 
of the resulting object is simply the sum of the four-momenta of the two objects Pi and 

Pj ' 

In contrast to this the Cambridge algorithm involves three basic components to form 
the final state jets. The algorithm starts from a table of iV bj primary objects, which is 
the set of the particles' four-momenta. It starts clustering the pair of particles with the 
smallest opening angle, using the ordering variable The test variable y? which is 
identical to the one for the Durham algorithm, decides when the iterative procedure is 
stopped. It is subsequently denoted by y^. The algorithm proceeds as follows: 

1. If only one object remains, store this as a jet and stop. 

2. Select the pair of objects i and j that have the minimal value for their ordering 
variable, Vij, with Vij — 2(1 — cos 6^). 

3. Inspect the test variable yij. 

• If yij < y cnt then combine i and j in a new object using the ^-scheme. Remove 
particles i and j from the table of objects that remain to be combined and add 
the new object with four-momentum p^ + pj. 

• If Vij > Ucut then store the object i or j with the smaller energy as a separated 
jet and remove it from the table. The higher energetic object remains in the 
table. 

Removing the softer of two resolved objects, as described in the last step, is called soft 
freezing. It prevents the softer jet from attracting any extra particles, thereby reducing 
non-intuitive clustering effects. 

3 The Cambridge algorithm: an example 

In order to test the various clustering algorithms, we generate Monte Carlo e + e~ — > qq 
events at = 91.2 GeV with the PYTHIA event generator ||. The generation includes 
parton showering ('parton-level'), and subsequent fragmentation and decays of the final 
state ('hadron- level'). The parameters of the Monte Carlo event generator are adjusted 
in order to provide an optimal description of large samples of hadronic Z° decay data || . 

To illustrate the differences between the Cambridge and Durham finders we present in 
Figure the three- momenta of a typical event projected onto the xy-plane. The underlying 
parton level is shown in the figure by the thick arrows and consists of a quark q recoiling 
against a qg system, with the gluon being relatively soft. 
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Figure 1: Display, projected onto the xy-plane, of a qq generated event at \/s = 91.2 
GeV. The lengths of the arrows are proportional to the energies of the objects. The 
jet-axes are indicated by the arrows, left at the parton level, right at the hadron level. 
The particle and jet three-momenta are shown for the Cambridge and Durham algorithms 
separately. Particle association to jets is indicated with various line-styles. The length of 
the parton and jet axes are scaled down by a factor of four. 



At the hadron level, the event is clustered again to three final state jets, both with 
the Durham and Cambridge algorithm. The final jets are indicated by thick arrows, and 
the association of particles to the three jets is indicated by various line styles. 

In this example one clearly observes the positive effect of soft freezing on the hadroniza- 
tion corrections. In the Cambridge algorithm the soft gluon jet is separated and classified 
as a final state jet. Most particles in the hemisphere are assigned in an intuitive way to 
the quark jet. The three final jets closely resemble the underlying parton structure. In 
contrast to this, in the Durham algorithm more particles are clustered around the soft 
gluon, so that the gluon jet becomes even more energetic than the quark jet. It is obvious 
that in this example the final state found for the Cambridge algorithm resembles the 
parton structure more than the Durham algorithm. 

As an illustration of some of the peculiarities of the Cambridge algorithm, in Figure 
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Figure 2: Example of the jet multiplicity, iVj et as function of y cnt for two Monte Carlo 
generated events. The dash-dotted line is for the Durham algorithm, the full line for the 
Cambridge algorithm. The points mark the y m ^ n transition values. 



we present the jet multiplicity as function of y cut for two events. In the figures the 
dashed lines correspond to the Durham algorithm, whereas the full lines correspond to 
the Cambridge clustering. For the Durham algorithm, the jet multiplicity is decreasing 
monotonically for increasing y cut . In addition, each event can be resolved into each jet- 
multiplicity iVjet, With 1 < iVj e t < iV bj. 

In the Cambridge finder the situation is a little more complex, as can be seen in the 
same figure. In the left plot of Figure [2] an example is given where 

• the jet multiplicity is not monotonically decreasing for increasing y cnt . 

In this example, at the resolution y cnt ~ 10~ 4 ' 3 , the jet-multiplicity decreases from 6 
to 5 to 4 when y cut increases, but then increases from 4 to 5 again. At y cut ~ 10~ 41 
the multiplicity decreases from 5 to 4. At y cut ~ 10~ 3 ' 8 a situation occurs where the 
jet multiplicity does not change from being 4, but the four final state jets change their 
four-momenta. The jet configuration of the two 5-jet states and three 4-jet state are all 
different. 

In the right plot of Figure ^| we show an example where 

• it may not be possible to resolve the event into a certain n-jet final state. 

In this example, at y cnt ~ 10~ 3 ' 3 , the event changes from being classified as a 6 jet event 
to a 3 jet event. For this event, it is impossible to choose a value for y cnt such that the 
event is resolved into a 4 or a 5 jet configuration. 

As a last 'peculiarity' of the Cambridge jet clustering we consider the particle to jet 
association. For the JADE and Durham algorithms, when crossing a transition value in 
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y cnt towards higher values, two of the jets are merged into one new jet while all other jets 
are left untouched. The resulting new jet consists of exactly all particles that belonged 
to the two merged jets, and the subjet history of jets can be traced unambiguously. In 
the Cambridge algorithm this need not be the case, since the sequence of recombination 
may be different for different values of y CU f It thus can happen that the particle contents 
of a jet at a given value for y cut does not match the sum of the particle contents of two 
resolved jets at lower values of y cnt . 

For many applications it is essential to obtain the transition values y m ^~ n . For example, 
in previous studies of e + e~ annihilation data the value of y 2 ^ 3 was analysed in order to 
obtain a s (Q 2 ) 0. In other studies all events were classified as four || or five || jets and 
their angular correlations were studied in order to probe the non-Abelian nature of QCD. 
In current studies at e + e~ annihilation energies reached by the LEP2 programme Q, 
events also have to be clustered to four jets in order to determine the W-boson charac- 
teristics in the hadronic decays of W + W~ pairs. Therefore, an algorithm to obtain all 
transition values y m ^ n with full information on the particle to jet association is highly 
desirable. 



4 Transition values of y cu t 

In the JADE and Durham algorithm, the sequence of clustering of an event can be deter- 
mined once and completely, and is independent of the value of y cut . From this clustering 
information about jet multiplicities, four-momenta and jet-particle association, can subse- 



quently be retrieved for any value of y cn t- This is the strategy used in the KTCLUS [nj and 
YKERN ]TT] packages. The final jet configuration is identical for all values of y cn t between 
two subsequent transition values. At the transition value y n ^ n+1 ^ the event flips from 
a n + 1-jet to a n-jet configuration. The transition values for the JADE and Durham 
algorithms are ordered in y cut . Using the transition values one can select a value for y cut 
such that the event is resolved into the required number of jets. 

In contrast to this, the clustering sequence in the Cambridge algorithm depends on the 
value of y cut because it distinguishes between ordering and testing variables. It is therefore 
no longer straightforward to calculate the transition values. In general, at the transition 
values y m ^ n the event can flip between a ra-jet configuration to a m-jet configuration 
where n and m are not necessarily consecutive. As it is important to obtain the values 
for y m ^ n , it was suggested in || to perform a binary search in y cut to determine these 
transition values, by repeated evaluation of the clustering. This proposal is not completely 
satisfactory since such a search has an intrinsic limited precision, might skip over several 
transition values and becomes very computing time intensive. 

We have developed a method to determine the transition values of y cut for the Cam- 
bridge finder exactly, as follows. While performing the clustering at a particular value 
of y cut , denoted by y- mit , we keep track of the maximum value of yij, between any two 
objects i and j encountered in this process, with y^ being always smaller than y- m i t . By 
construction this maximum value, which we denote by y™ ax , is smaller than y- m it- We 

now note that for any value of y cut £ ylT x , yimt ) , the Cambridge algorithm will follow 



the same clustering sequence. Only when the cluster algorithm is performed with a value 
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y cnt smaller than y™ ax , the condition y^ > y cut is satisfied at least once more and the 
subsequent clustering sequence may change completely. The value yf- ax is therefore one of 
the y cut transition values. Note that the clustering may also change completely for values 
of y cut larger than y init . 

These observations can be utilised to scan the complete region of y cut . We therefore 
start by clustering the complete event to a one-jet configuration by chosing y cnt = 1 in 
the first step. After this step one iteratively repeats the clustering to calculate smaller 
and smaller values of y™ ax at which the clustering changes, and thereby calculates smaller 
and smaller transition values. The process terminates if either the number of resolved 
jets equals the number of input objects or if the desired number of jets is resolved. To 
summarise: 

1. Start with value y init = 1 and set y cut = y init . 

2. Perform the Cambridge jet clustering for iV bj objects. During the clustering, keep 
track of all values of yij between all objects i and j, and determine their maximum 
value, y™ ax . 

3. Store the value of y™ ax , the number of jets, n, their four-momenta and the jet-particle 
association. The clustering for y cut > y™ ax is now completely determined. 

4. The algorithm stops if: 

• The number of resolved jets equals the number of input objects, n = iV bj. 
Then the event is classified completely and the algorithm necessarily stops. 

• The desired number of jets or a preset lower limit in y cnt is reached, and the 
algorithm is stopped. 

5. Set y cut = y™ ax and go to step 2. 

Once this process has been performed, all information about the clustering is accessible 
without any appreciable additional computing time. The total amount of computing time 
is proportional to the desired jet-multiplicity. For example, to study four jet final states 
with the Cambridge finder requires approximately four times as much computing time 
compared to the Durham or JADE algorithms. 

Instead of the top-down approach for which the clustering starts at y cut = 1 as ex- 
plained above, a bottom-up approach is in principle also possible. One may implement 
the bottom-up approach by starting the clustering at the lowest value y cnt = 0. For 
iV bj given at the start of the clustering, one finds the pair of objects with smallest value 
Vij (corresponding to the pair closest in angle) and determines the corresponding value 
for yij. Then the two possible cases are considered: one in which the softer object is 
frozen, the other in which the two objects are combined. In both cases the number of 
objects that remain to be combined is reduced by one. This combinatorical procedure is 
subsequently continued, and all possible clustering sequences are listed. The procedure 
terminates when only one object remains. 
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From the corresponding values for y^, saved for each step, one can deduce the final 
transition values y m ^ n and the jet configuration associated to them. Note that the number 
of possible clustering sequences is proportional to 2 N ° h i +1 , which limits the practical use 
of the bottom-up approach. 



5 Monte Carlo results 

Jet finder comparison and hadronization corrections 

With the transition values y m ^ n defined both for the Cambridge and Durham algorithms, 
we compare, as an example, the values for y 2 ^ 3 . In [0 similar studies have been per- 
formed to compare the performance of the Durham and JADE algorithms. In all the 
following we will define the region with the highest value for y cut as the nominal y cu t 
region. Here, in Figure |3|a we show the correlation of the Cambridge and Durham algo- 
rithms for y 2 ^ 3 at the parton level, at the end of the PYTHIA parton shower. For most 
of our generated events, the obtained values for y 2 ^ 3 are identical for the Cambridge and 
the Durham algorithms (approximately 75% of the events are found on the line in the 
figure). For a small fraction of events, the value obtained with the Cambridge algorithm 
is smaller compared to the value for the Durham algorithm. At the hadron level, as shown 
in Figure |3|b, the values at low y 2 ^ 3 obtained using the Cambridge algorithm are smaller 
for almost all events, but become similar for the two algorithms for increasing values of 
y 2 ^ 3 . At the hadron level, approximately 15% of the events have identical values for y 2 ^ 3 
for both algorithms. 

Next, in Figure |^a and|^b, we compare the hadronization corrections for the Cambridge 
and Durham algorithms. We present the correlation between the transition values y 2 ^ 3 
calculated at the hadron and at the parton level, for both. The line indicates the ideal 
case for which equal values for y 2 *~ 3 at both levels are found. For the Durham algorithm 
the difference in y 2 ^ 3 at the parton and hadron level is small. When going to lower y 2 ^ 3 
values, the distribution broadens and shifts toward smaller y 2 ^ 3 values at the hadron 
level. For the Cambridge algorithm, at high values of y 2 ^ 3 the parton and hadron level 
correlation is similar to the one for the Durham algorithm. Whereas, when going to 
smaller values for y 2 ^ 3 , the values at the hadron level get increasingly larger with respect 
to the parton level values. The width of the distribution is similar to that for the Durham 
algorithm. In order to quantify the differences, we calculated the mean of the logarithmic 
ratio of the y 2 ^ 3 values for the parton level and the hadron level: this value equals 
0.232 ± 0.002 for the Durham algorithm, and 0.257 ± 0.002 for the Cambridge algorithm, 
which indicates that the overall hadronization corrections for the Durham algorithm are 
~ 10% smaller than for the Cambridge algorithm. Note however that the hadronization 
corrections do not only depend on the jet algorithm but also on the hadronization model 
used. 

The mean hadronization corrections can be studied more directly, as a function of y cnt , 
from plots as presented in Figures [5]a and |5|b. In Figure |5|a we show the mean of the log- 
arithmic ratio of the values y 2 ^ 3 for the parton and hadron level, (^og 10 (y 2 ^ 3 on /yl^, on )j, 
as a function of the transition value at the parton level. When calculating the mean 
deviation between hadron and parton level for each value y 2 ^ 3 , the contribution to the 
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Figure 3: Monte Carlo generated events at ^Js = 91.2 GeV using PYTHIA ||/. In a) we 
present the correlation between y 2 ^ 3 calculated using the Cambridge and y 2 ^ 3 calculated 
using the Durham algorithm, at the parton level. Note that the majority of events have 
identical values of y 2 ^ 3 for both algorithms. In b) the same is shown for the hadron level. 




log 10 y (Hadron) log 10 y (Hadron) 

Figure 4: Monte Carlo generated events at yfs = 91.2 GeV using PYTHIA ||/. In a) we 
present the correlation between y 2 ^~ 3 calculated at the parton level and y 2 ^ 3 calculated at 
the hadron level, using the Durham algorithm. In b) the same is shown for the Cambridge 
algorithm. 



9 



-4 -3 -2 -1 -4 -3 -2 -1 

J parton / §10^ J parton / 

Figure 5: In a) we show for both the Cambridge and the Durham algorithms the mean 
logarithmic ratio between y 2 ^~ 3 calculated at the parton and the hadron level, as function 
of y 2 ^ 3 at the parton level, using the PYTHIA event generator. Identical values for y 2 ^ 3 
are represented by the full horizontal line. The dashed lines correspond to a deviation 
between the parton and hadron level by a factor of two. In b) we present the absolute 
value of the mean as calculated in a). The three regions indicated in both hgures by I, 
II and III are discussed in the text. 



hadronization corrections for many events may cancel. To exclude effects due to can- 
cellation we present in Figure |5|b the size of the hadronization corrections. We show 
the mean absolute difference of y 2 ^~ 3 calculated on the parton and at the hadron level, 
{^Abs (log 10 (?/p^ 3 on /^^on)))> as a function of y 2 ^ 3 calculated at the parton level. 

To compare the performance of the two jet finders we distinguish in Figure |5|a and |5|b 
three regions in y 2 ^ 3 , denoted by /, II and III. In region J, for values of y 2 ^~ 3 above 
10 -2 , the hadronization corrections are small and comparable for both algorithms. A 
fraction of about 37% of our generated events belongs to this region. 

Region // is defined for values of y 2 ^ 3 between 10~ 3 ' 2 and 10~ 2 . In this region differ- 
ences between the two algorithms occur. The mean deviation for the Durham algorithm 
reaches a maximum of about 20%, and vanishes at y 2 ^ 3 ~ 10~ 2 ' 8 , as can be seen in 
Figure |5]a. However, Figure [5]b shows that this decrease is due to cancellations and that 
the absolute hadronization corrections increase at y 2 ^ 3 ~ 10~ 2 ' 8 . For the Cambridge 
algorithm a different behaviour is observed. The mean deviation reaches a maximum of 
100% for this algorithm, at y 2 ^ 3 ~ 10~ 2 ' 6 , implying large hadronization corrections. The 
absolute hadronization corrections for the Cambridge algorithm, as shown in Figure [5|b, 
reach a maximum at about 10~ 2 ' 6 , and then decrease until the value for the Durham al- 
gorithm is reached. In the whole region //, where about 49% of our generated events can 
be found, hadronization corrections for the Cambridge algorithm are significantly larger 
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Figure 6: In a) we present the relative jet production rates R n , as a function of y cut , 
for the Durham algorithm, using the PYTHIA event generator. The full lines with the 
points correspond to the hadron level and the dashed line correspond to the parton level. 
For i?5 we summed all contributions for jet multiplicities of 5 and larger. In b) the same 
is shown for the Cambridge algorithm. 



than for the Durham algorithm. 

In region III, for values of y 2 ^ 3 below 10~ 3 ' 2 , the figures show that the hadronization 
corrections are large for both algorithms and that they increase rapidly towards smaller 
values of y 2 ^ 3 . The corrections for the Cambridge algorithm are smaller compared to the 
Durham algorithm. However, only about 14% of our generated events can be found in 
region 

Our analysis of the hadronization corrections for y 2 ^~ 3 shows that the Cambridge al- 
gorithm performs clearly better only in the region of low y 2 ^ 3 values (region III). This 
region contains 14% of the events and the corrections there are large for both algorithms. 
In all other regions the Durham finder performs equally well (region I) or even signifi- 
cantly better (region II). These two regions contain a fraction of 85% of our generated 
e + e~ — > qq events. These basic tendencies of the hadronization corrections are also found 
when studying the transition value y 3 ^ 4 . Note that for very low values of y cnt the ap- 
proximations used for the implemention of QCD in PYTHIA might not give a reliable 
description of the hadronization process. Therefore, for very low values of y cn t the jets 
returned by the Cambridge algorithm may correspond closer to the underlying parton 
structure than can be shown in these Monte Carlo studies. 

Classical tests of QCD rely on relative production rates for multijet hadronic decays, de- 
fined as R n = o n jo to t ||nj- In Figures |6|a and []b we present the relative production rates 
for two, three, four, and five or more jet final states, for the hadron level and the parton 
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level. For these figures we used the same set of e + e~ — > qq events as before. In Figure ^|a 
the performance for the Durham algorithm is shown. For all y cnt values between one and 
approximately 10~ 2,8 the hadron and parton level agree reasonably well. When going to 
smaller values of y cut the curves for the two levels increasingly deviate. For the Cambridge 
algorithm the hadronization corrections are larger for the region in y cut where the Durham 
algorithm performs well, between y cut of one and approximately 10~ 2 ' 8 . However, when 
going to lower values of y cut the differences between hadron and parton level are, for most 
jet multiplicities, smaller than for the Durham algorithm. 

To summarise our investigations of the hadronization corrections, we conclude that 
the Durham algorithm provides smaller hadronization corrections for a large region in 
y cut- 
in 0, the hadronization corrections for the mean jet multiplicity, (nj et ) = J2T n R-n, 
was studied. There it was found that hadronization corrections for the Cambridge and 
Durham algorithms are small for values of y cnt > 10~ 32 . Our Figures |5|b show that for 
these values of y cnt the hadronization corrections for each jet production rate, R n , are 
sizable for the Cambridge algorithm, whereas for the Durham algorithm they are small. 
The small hadronization corrections found for the Cambridge algorithm in the study of 
the mean jet rate (rij et ) are due to fortuitous cancellations in the individual jet production 
rates. 

Multiple and impossible jet multiplicities 

As already indicated, the transition values y m ^ n in the Cambridge algorithm need not be 
the transition between two consecutive jet-multiplicities. Several intervals in y cut may lead 
to the same jet-multiplicity, and they have in general different jet four-momenta. Secondly, 
it need not always be possible to cluster the event to any required jet multiplicity. 

In order to determine the frequency that this might occur, we generated for Figure [F] 
e + e _ — > qq Monte Carlo events with full hadronization, at y/s = 91.2 GeV. The full 
points present the fraction of events that have multiple regions in y cut with the same jet 
multiplicity, as a function of the jet-multiplicity. For our generated events, for example, 
about 2.2% have multiple regions in y CVLt that lead to a four-jet final state, albeit with 
different jet four-momenta. In the same figure the open points show the fraction of events 
were the indicated number of jets could not be resolved. For example, in about 2.5% of 
events no four-jet configuration could be found. 

The figure shows that the fraction of impossible jet multiplicities and multiple jet 
multiplicities increases with increasing n, reaches a maximum at around n = 12, and 
decreases again. The generated events have a mean total multiplicity of 44.2, and 95% of 
the events have a multiplicity larger than 25. Note that both distributions are naturally 
limited by the input number of four- momenta in each event. 

As another example we generated hadronic decays of W + W~ pairs at LEP2 [|| us- 
ing PYTHIA: e+e" -> W + W~ -> qq'q"q'\ at y/s = 184 GeV. Information about the 
kinematics of the two Ws can be obtained by forcing the hadronic final state to four jets. 

Using the Cambridge finder, we find that about 0.9% of the events have multiple 
regions in y cnt with four final state jets. For those events one therefore has the freedom 
to select the set of jets with the larger y mt values, or the set with the smaller y cut values. 
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Figure 7: The full points show, as a function of n-jet, the fraction of events for which 
multiple n-jet final states could be reconstructed. This is done for a set of e + e~ — > qq 
generated events. The open points show, as a function of n-jet, the fraction of events for 
which no n-jet final state could be reconstructed. All points are normalised to the overall 
number of generated events. 



Clearly the selection which corresponds closer to the four primary partons is preferred. In 
Figure |8|a we compare the mean opening angle between the jets and primary four partons, 
and in Figure §b the mean absolute energy difference between the jets and the primary 
partons. It can be clearly seen that in both cases the resolution is better for the fraction of 
99% of events in which only one four-jet configuration is found. For the small fraction of 
events where two four-jet configurations were found, the jet configuration with the lower 
values of y cn t matches the four primary partons better than the one with larger values of 
y C ut, which can be explained by the following observation. In the majority of events for 
which the Cambridge algorithm returned two four jet configurations the appearance of 
hard gluon radiation in the parton shower was observed. Detailed inspection revealed that 
the hard gluon, radiated from a quark-pair originating from one W, points in the direction 
of a quark originating from the other W. The configuration with the low value of y cut 
correctly separates the gluon from this quark by the mechanism of soft-freezing, whereas 
they are merged for the configuration with the larger value of y cut . The correspondence 
between partons and jets is therefore better in the configuration with the lower value of 

Vent- 
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Figure 8: Monte Carlo generated W + W~ events at yfs = 184 GeV. In a) we present 
the mean of the opening angles between the momenta of the four partons and of the four 
jets at the hadron level. The shaded histogram shows the events for which one four-jet 
configuration was found. The open histograms correspond to the set of events were the 
Cambridge algorithm returned two four-jet configurations. The one with the dashed line 
corresponds to the configuration with the higher values of y cnt , whereas the one with the 
full line corresponds to the configuration with the lower values of y cut . In b) the same is 
shown, but for the mean absolute energy difference between the four parton and hadron 
jets. 



6 Conclusions 

In this note we review the Cambridge jet clustering algorithm, as was recently introduced 
in ||. We show some of its particularities for Monte Carlo generated events. Firstly, 
the algorithm may find several regions in y cnt with identical final state multiplicity, but 
different jet four-momenta. Secondly, for some events it is impossible to resolve a certain 
jet multiplicity. Both these properties are absent in the JADE and Durham algorithms. 

We propose a fast, new algorithm that is able to determine the transition values for y cut , 
based on the YCLUS package. All transition values, jet multiplicities, jet four-momenta and 
the jet to particle associations are derived and stored, and can be subsequently inferred 
for all values of y cnt without any substantial additional computing time. 

Using this algorithm we determine the hadronization corrections of e + e~ — > qq gen- 
erated events according to PYTHIA, by comparing parton and hadron level values for 
y 2 ^ 3 , both for the Durham and Cambridge algorithms. This comparative study of the 
two algorithms is completed by a presentation of the relative jet production rates. For a 
large interval of y CVL t values the hadronization corrections for the Cambridge algorithm are 
found to be significantly larger than for the Durham algorithm. However, in the region 



14 



of very small values of y cut (y cut < 10~ 3 " 2 ), the hadronization corrections are large, but 
better under control for the Cambridge algorithm. Note that for very low values of y cut 
the reliability of the comparative Monte Carlo studies is limited due to the fact that for 
these values of y cu t the approximations used for the implementation of QCD in PYTHIA 
might not give an appropriate description of the hadronization process. 

Further, we present for the Cambridge algorithm the fraction of events for which 
certain jet multiplicity could never be resolved, or could be resolved multiple times. Four 
jet final states were explicitly studied in hadronic decays of W + W~ events. The large 
fraction of events where just one four jet configuration was found has better energy and 
angular resolution than the small fraction of events with multiple four jet configurations. 

Fortran code, containing our CKERN routines to obtain the y cut transition values, can 
be obtained from the World-Wide Web at 

http : //wwwcnl . cern. ch/~stanb/ ckern/ ckern .html . 
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